Time Series Motif Discovery and Anomaly Detection Based on Subseries Join
نویسندگان
چکیده
Time series are composed of sequences of data items measured at typically uniform intervals. Time series arise frequently in many scientific and engineering applications, including finance, medicine, digital audio, and motion capture. Time series motifs are repeated similar subseries in one or multiple time series data. Time series anomalies are unusual subseries in one or multiple time series data. Finding motifs and anomalies in time series data are closely related problems and are useful in many domains, including medicine, motion capture, meteorology, and finance. This paper presents a novel approach for both the motif discovery problem and the anomaly detection problem. First, we use a subseries join operation to match similar subseries and to obtain similarity relationships among subseries of the time series data. The subseries join algorithm we use can efficiently and effectively tolerate noise, time-scaling, and phase shifts. Based on the similarity relationships found among subseries of the time series data, the motif discovery and anomaly detection problems can be converted to graph-theoretic problems solvable by known graphtheoretic algorithms. Experiments demonstrate the effectiveness of the proposed approach to discover motifs and anomalies in real-world time series data. Experiments also demonstrate that the proposed approach is efficient when applied to large time series datasets.
منابع مشابه
Motif and Anomaly Discovery of Time Series Based on Subseries Join
Time series motifs are repeated similar subseries in one or multiple time series data. Time series anomalies are unusual subseries in one or multiple time series data. Finding motifs and anomalies in time series data are closely related problems and are useful in many domains, including medicine, motion capture, meteorology, and finance. This work presents a novel approach for both the motif di...
متن کاملSubseries Join and Compression of Time Series Data Based on Non-uniform Segmentation
A time series is composed of a sequence of data items that are measured at uniform intervals. Many application areas generate or manipulate time series, including finance, medicine, digital audio, and motion capture. Efficiently searching a large time series database is still a challenging problem, especially when partial or subseries matches are needed. This thesis proposes a new definition of...
متن کاملAnomaly detection and motif discovery in symbolic representations of time series
The advent of the Big Data hype and the consistent recollection of event logs and real-time data from sensors, monitoring software and machine configuration has generated a huge amount of time-varying data in about every sector of the industry. Rule-based processing of such data has ceased to be relevant in many scenarios where anomaly detection and pattern mining have to be entirely accomplish...
متن کاملVisualizing and discovering non-trivial patterns in large time series databases
Data visualization techniques are very important for data analysis, since the human eye has been frequently advocated as the ultimate data-mining tool. However, there has been surprisingly little work on visualizing massive time series datasets. To this end, we developed VizTree, a time series pattern discovery and visualization system based on augmenting suffix trees. VizTree visually summariz...
متن کاملRobust and Accurate Anomaly Detection in ECG Artifacts Using Time Series Motif Discovery
Electrocardiogram (ECG) anomaly detection is an important technique for detecting dissimilar heartbeats which helps identify abnormal ECGs before the diagnosis process. Currently available ECG anomaly detection methods, ranging from academic research to commercial ECG machines, still suffer from a high false alarm rate because these methods are not able to differentiate ECG artifacts from real ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010